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The present paper introduces ad hoc communication networks as examples of large scale real 
networks that can be prospected by statistical means. A description of giant cluster formation 
based on the single parameter of node neighbor numbers is given along with the discussion of some 
asymptotic aspects of the giant cluster sizes. 
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I. INTRODUCTION 

Nowadays, natural and designed networks are in the 
focus of research in different scientific disciplines. Us- 
ing computers the amount of available empirical data on 
real world networks has been increased during the past 
few years. Examples of real networks include the World 
Wide Web [J Q, the Internet 0, & E , collabora- 
tion networks of movie actors and scientists d H El, 
power grid s [llllT^l a nd the metabolic network of living 
organisms |l3Ul4,ll5lll6| . 

Random graphs are natural candidates for the descrip- 
tion of the topology of such large systems of similar units. 
In 0, 0, 0] the authors have developed a model - 
which assumes each pair of the graph's vertices to be con- 
nected with equal and independent probabilities - that 
treats a network as an assembly of equivalent units. 

This model, introduced by the mathematicians Erdos 
and Renyi, has been much investigated in the mathemati- 
cal literature [MEl- 

However, the increasing availability 
of large maps of real-life networks has indicated that the 
latter structures are fundamentally correlated systems, 
and in many respects their topologies deviate from the 
uncorrelated random graph model. 

Two classes of models, commonly called the small- 
world aravhs\n\. Il2t and the scale-free networks\2A 




FIG. 1: Nodes and connections of an example ad hoc network. 
The transmission range is the same for all nodes - it is noted 
by the dotted circle for two of the nodes. The shortest path 
between the s source and d destination users touches 3 inter- 
mediate nodes, and there is an alternative route of 6 hops, 
which has no common intermediate nodes with the first. 
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|24| have been developed to capture the clustering and the 
power law degree distribution present in real networks^, 

S Q MM E3 El El M M EE M M M 

Here we present ad hoc networks |3(1| as new examples 
of real structures that can be investigated using the above 
network models. Ad hoc networks arise in next genera- 
tions of communication systems and hereby we try to 
summarize the principal characteristics of such systems. 
In the ad hoc scheme users communicate by means of 
short range radio devices, which means that every device 
can connect to those devices that are positioned no far- 
ther than a finite maximum geometrical range. We call 
this range the given device's transmission range and the 
exact value of this range may depend on the transmit- 
ter's power and various other physical parameters. See 
Fig. ^ f° r an example ad hoc network topology. Neigh- 
bor nodes talk the way ordinary radios - like CBs - do, 
however communication between non-neighboring users 
is also possible. The latter case is accomplished by send- 
ing the information from the source user to the desti- 
nation hop by hop, through intermediate nodes. If the 
density of users in the area is high compared to their 
transmission ranges, it is highly possible that more than 
one alternative route exists between two users. This last 
feature can be exploited in the case if the shortest route 
is overloaded or broken, or if the system allows splitting 
the information flow into separate parallel flows. More- 
over, the users are free to move randomly and organize 
themselves arbitrarily; thus, the network's topology may 
change rapidly and unpredictably. Such a network may 
operate in a stand-alone fashion, or may be connected to 
the Internet. 

Giant clusters in ad hoc networks are made interesting 
because a communication network provides a meaning- 
ful service only if it integrates as many users as possible 
within the covered area (e.g., 99% may be considered 
a good coverage). In this paper we introduce a fractal 
model, that duplicates the giant component formation in 
ad hoc networks in an area inlaid with obstacles, par- 
tially screening radio transmission. Our main result is 
that in such networks the giant component size can be 
described by a single parameter: the average number of 
neighbors a node has. The rest of this paper is struc- 
tured as follows. Section 1TT1 gives a detailed description 
of our random ad hoc network model. In Sections IIIII 
and IIVI we delve into the topology differences between 
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random graphs and graphs built using our model. Sec- 
tionlVlshows the numerical simulation results supporting 
these analyses. 



II. THE RANDOM AD HOC NETWORK 
MODEL 

A wireless ad hoc network consists of a number of radio 
devices, also referred to as "nodes" in the following. Ev- 
ery node may be connected to one or more other nodes in 
her vicinity; the actual set of connections depends on the 
distance of the nodes. In a static environment these con- 
nections define the topology of the system; if the nodes 
allowed to move then the topology may change, however 
at any given point of time there is still a well defined 
topology available. 

To be precise we define a random ad hoc network as a 
set of uniformly distributed nodes on the arena of the unit 
Euclidean square [0; 1] x [0; 1] with connections between 
pairs of them. The connections are two-way in the sense 
that if node A can communicate to node B, then node B 
is also able to communicate to node A. 

Two nodes are connected if the geometrical distance of 
the two is less than a certain value rt, that is the nodes 
can communicate up to their " transmission range" . We 
represent a realization of such a system using an undi- 
rected graph G(V, E), where the vertices and the edges 
denote the nodes and the two-way connections respec- 
tively. Sometimes a graph resulting this way is referred 
to as a geometric random graph or GRG. Note that there 
are no loops and no multiple edges in G: a) a node should 
not communicate to itself; and b) if two nodes are neigh- 
bors, then technically there is no sense to open a second 
communication channel between them. 

Furthermore, all the length parameters in the system 
are made dimensionless as follows. Length is measured 
as the multiples of the unit radius ro, which is in turn 
defined by the share of the whole area for each node: 



ro := 



A 



(1) 



where A is the size of the arena. The ratio of the trans- 
mission range and the unit radius is called the normalized 
transmission range and noted by: 



n 
ro 



(2) 



As mentioned in the Introduction, a communication 
network may deliver meaningful service only if the net- 
work is connected, or at least has a vast subset that is 
connected. Our work is focused on examining the criteria 
for giant cluster formation and in particular in networks 
with fractal connectivity properties. 

In the following we give a short overview of networks 
on random graphs and afterwards we turn to our model 
of fractal ad hoc connectivity. 



III. CONNECTIVITY IN RANDOM 
NETWORKS 

After distributing and connecting the nodes as de- 
scribed previously, the largest connected component of 
G can be determined. Let S be this components' size 
fraction: 



s 



nodes in the largest component 
N 



which quantity is obtained by counting. This quantity is 
of particular importance because the network gets fully 
connected if S diverges and for this end we are to inves- 
tigate its relationship with other network parameters. 

In the authors present the theory of random 

graphs\]j$ of arbitrary degree distribution. Among oth- 
ers, an exact result for the component sizes is given, 
which we shall cite here. It is shown, that the average 
component size diverges if 



^fc(fc-2)p fc = 



fc=0 

holds, where p^ is the degree distribution of vertices in 
G. Let us use here the actual distribution of our ad hoc 
network: it is easily seen that the probability distribution 
of the number of nodes contained in any disc with radius 
r n is the Poisson-distribution with expectation value of 
r?. It means that 



Pk 



to 

fc! 



(3) 



is the probability that a vertex will have fc — 1 neighbors 
(the —1 is because the node itself does not count for a 
neighbor) . Applying the result in |3l| , one can derive the 
relationship of the size of the giant component S and the 
transmission range: 



log(l - S) 
-S 



(4) 



It shall be noted here that while (0} holds for random 
networks and - as it is to be shown in Section [V] - for 
fractal ad hoc networks, the r n — S relationship is different 
for the finite range ad hoc case, however, the latter is to 
be discussed in a separate paper. 



IV. THE FRACTAL AD HOC NEIGHBORSHIP 
ALGORITHM 

The results of the previous section apply for scenar- 
ios where the arena is " flat" : that is the only limit to 
build a connection between two nodes is their geomet- 
rical distance. In the present section we introduce the 
idea of generalized obstacles that can screen nodes from 
each other even if they are positioned within transmis- 
sion range. This change produces graphs with extended 
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FIG. 2: Giant component sizes for various values of N and 
8. Note that S(N) reaches 1 for 8 < 2, yet 

SmaxCS) < 1 for /3 > 2. 



lim SW,/3) = 



subgraph in G. Clearly, if (C) = 0, then every connected 
component contains a single node, and in the TV — > oo 
limit 5 1 becomes 0. Also, if (C) diverges or even if only a 
single node is connected to all the others, the graph obvi- 
ously gets fully connected. Based on these considerations 
we are to examine (C) in detail. 

Vertex degree in G can be calculated by fixing a sin- 
gle node and totaling the (C r ) expectation value of the 
number of neighbors that reside exactly at the distance 
r away from the fixed one. Assuming that the density of 
nodes is constant (N/A), (C r ) can be expressed by mul- 
tiplying the average number of nodes in distance r and 
the probability J5j|: 



(Cr) 



2m 
~A~ 



Np(r) 



Now if p = N/A, the average vertex degree is 
(C) = f (C r )dr = / p(r)2Trprdr 



(6) 



spatial structure which is why we call the algorithm frac- 
tal. 

The obstacles are adopted by changing the algorithm 
for edge generation. Now two nodes within the transmis- 
sion range will be connected with a probability which is 
given as the function of their geometrical distance. For 
every two nodes u,v e V let p(dist(u,v)) be the proba- 
bility that an edge e uv £ E connecting them is set up. 
For the description of our obstacles we use a long tailed 
probability function which is implied by the picture of 
a hilly landscape, where the possibility of connections 
drops with the increasing geometrical distance between 
the nodes, however long range connections are still pos- 
sible: 



p(r) 



rip) 







(5) 



with parameter values a > and (3 > 0. 

Performing computer simulations of networks con- 
nected according to 10, one obtains different results, as 
(3 changes. On Fig. [5] we compared the resulting giant 
cluster sizes for different (3 values. At lower parameter 
values S(N) saturates to S = 1 - all nodes become el- 
ements of the giant cluster above a certain finite node 
number. For (3 — 2.5 and above S still converges to a 
finite value, however the limit now is strictly less than 
1. It means that networks with such parameter values 
will not become fully connected even at large node num- 
bers, moreover, the proportion of the largest connected 
subgraphs drops with (3 worse than linearly. In the rest 
of this Section we try to interpret this dual behavior of 
S(f3). 

It is easy to imagine that the more connections the 
nodes have in average, the larger the giant cluster grows. 
More accurately we state that the average vertex degree 
(C) determines the cardinality of the largest connected 



where 21 represents the physical boundaries of the arena. 
As there are no nodes outside this region, thus the inte- 
gral shall be outside 21. 
In general solving © yields 



(C) = a2irp 



ro(3 
1-/3 

r 

'^8 



1-/3 



ro(3 
2-/3 



r 



1-0 



(7) 



J 21 



However the expectation value of (C) is dependent on 
the value of (3. Accordingly, our discussion is separated 
into several cases. 

a (3 > 2. In this case J7J) can be evaluated for 21 being 
the interval r £ [0; oo) in the limit where rp — * 0: 



(C) = 



a2np ■ rl(3 2 



2af3 2 



(1-0X2-/3) 09-1)08 
Furthermore, knowing that 
1 



(8) 



lim , 

«^oo (1 + x) a 

in the (3 — > oo limit JSJ becomes 
(C) = a2ir-prl = 2a 

b/3 = lor/3 = 2. Q diverges logarithmically in r, 
thus C does not have an expectation value. 

c (3 < 2 and (3^1. Here (C) will diverge as N — > 
oo, however unlikely to the previous case we try to 
determine the (C(N)) relation. First let us rewrite 
(0 as 



(C) 



a2?:p ■ r /3 
(l-/3)(2-/3) 



r(l-/3)-r /3 

d + -^-)f } - 1 



(9) 
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Concerning the r-dependence in [. . we can as- 
sume that there is a maximal transmission range 
''max such that for transmission ranges r > r max 
the contribution of the integrand in © is negligi- 
ble. This way the [. . .]<a part of © can be estimated 
as 



rod 



(1-/3) 



roll 



13-1 



rofi 



(3-1 



(10) 



Now if ro — > (which happens to be the case at 
sufficiently large node numbers) the first term in 
(|10JI vanishes and the +1 becomes negligible in the 
denominator of the second term. After substituting 
this second term and simplifying the expression, (J7J) 
finally becomes 



(C) 



a2irp ( roP \ ^ 
2^ 



The TV-dependence of (C) can be derived from here 
by substituting definition 0J, p = N/A and the 
fact that r^ ax oc A. By these means the above 
expression yields: 



(C) cx 7V 1 " 



(11) 



To summarize, if (3 > 2, then a finite neighbor count 
is expected, and thus such networks are not going to be 
fully connected (see again Fig.EJ). On the other hand, if 
(3 < 2, then (C) diverges exponentially with increasing 
node numbers, which in theory leads to fully connected 
networks at large N , and means, that the more nodes are 
in the system, the larger the fraction of connected nodes 
is to become. 



V. SIMULATION RESULTS 

We carried out computer simulations to illustrate our 
findings, especially Eqs. JHJ) and Qllfl. During a simula- 
tion run we first pick the random coordinates for the N 
nodes. Second the probability p is calculated according 
to J5j|, using the input parameters a,/3 and r. Then for 
every two nodes a uniform random number £ S [0; 1] is 
generated and compared to p: for cases £ < p an edge 
connecting those two nodes is recorded. Finally we count 
the component sizes and take the largest of these. The 
output of the simulation run is the average vertex de- 
gree, (C), and the largest components' size, S. Note the 
analogy with © if using the fact, that here (C) = r^. 

As the first test we recorded the giant cluster size vs. 
transmission range relationship. Data points were ob- 
tained by repeated runs, changing only the amplitude pa- 
rameter a of (JjjJ in an appropriate interval (e.g., a £ [1; 9] 
for the (3 = 2.5 case). Fig. illustrates that in a net- 
work connected using the fractal neighborship algorithm 
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FIG. 3: Simulations of ad hoc networks using the fractal 
neighborship algorithm with parameter values both f3 < 2 
and f3 > 2 yield the same giant cluster size vs. average vertex 
degree as random graphs. Inset displays the same plots with 
— log(l — S)/S as the abscissa. 
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FIG. 4: Average vertex degree of ad hoc graphs for f3 > 2. 
Data points were acquired using a = 0.8, (3 = 5.6; the dashed 
line yields the analytical result (C) = 3.03, which shall hold 
in the N — > oo limit. 



the observable S — (C) relationship matches the equiva- 
lent analytical result for random graphs for both relevant 
cases a and c in Section llVI 

On the other hand, the behavior of (C) turns out to 
be sensible to the value of /3, as it was expected. Let us 
start with the case (3 > 2. Fig. 0] presents the simulation 
results for networks connected as by JSJ), using a — 0.8 
and (3 = 5.6. According to JSJ, the average vertex degree 
is expected to be 



(C) = 



2-5.6 2 



3.03 



4.6-3.6 

in this case. It is clearly seen on the Figure that increas- 
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FIG. 5: Divergence of average vertex degrees with N for /3 < 
2. Crosses: data points for a = 0.1, (3 = 1.56; dashed line is 
oc jV 1 -* 3 / 2 = jV - 22 . Inset displays the same plot with both 
axes logarithmic. 



ing N, the simulation output converges to the analytical 
result. 

Now let us turn to the < (3 < 2 case. On Fig. [5] the 
data obtained for a = 0.1 and (3 = 1.56 is shown along 
with a numeric function fit according to 1)11(1 : 



Figs. 01 and [3] now illustrate the differing 5-behavior 
presented on Fig. [3 as a data set for (3 = 5.6 would 
converge to some connectivity < 20% even at very large 
N, while the one for (3 = 1.56 is clearly reaching 5=1 
for node numbers in the magnitude of several thousands. 



VI. CONCLUSIONS 

In the present paper we have investigated the con- 
nected components that are produced in random ad hoc 
networks. Based on the results, the number of nodes 
needed for a given connectivity ratio can be estimated. 
Thus our results may hint about the usefulness of random 
fractal ad hoc networks. 

We modified the conventional connection function and 
made long range connections possible. This way the pro- 
ducing networks become extended in their spatial struc- 
ture, as thought the network is situated in an area with 
obstacles screening some of the transmissions. We have 
found that a single parameter - the average neighbor 
count (C) - can characterize the proportion of the largest 
connected subnetwork. We have also seen that depending 
on the connection function parameters, this proportion 
can be either bounded or unbounded as the system size 
N is increased. For both cases (C(N)) was derived ana- 
lytically and confirmed by simulations. 



(C(N)) ^co-N 1 -^ + C1 

(the parameters turn out to be cq = 0.74 and c\ — 
— 1.38). The simulations agree with the N l ~®l 2 diver- 
gence well, as calculated in Section llVl 
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